Balancing Memory Load in D-CAPE Using Replication-Style Distributed Partitioning of Operator States
نویسندگان
چکیده
Distributed continuous stream query systems have been shown to meet the needs of a range of high-volume applications. However, in some applications the stream queries for these applications may require a wide range of sizes of allocated memory. In such cases it may not be feasible to continue to store the data for any of the largest operators on a single machine. In addition, for the benefit of the users of such a system, we would like to offer as much flexibility in query processing as possible. This includes making full use of resources on the network, even with a highly fluctuating volume and distribution of data. This thesis proposal discusses an unexplored method of partitioning a query operator across several machines, without predetermining a destination for each new input for the operator. The proposed extension consists of the ability of the D-CAPE distributed continuous stream query system to activate a query plan operator on several machines simultaneously, and replicate new input data to each of these operators. Experiments will be conducted to observe the limits of high-volume query processing in D-CAPE with and without these new extensions, and to experimentally optimize these limits through the manipulation of several internal options.
منابع مشابه
Distributed Graph Layout for Scalable Small-world Network Analysis
The in-memory graph layout or organization has a considerable impact on the time and energy efficiency of distributed memory graph computations. It affects memory locality, inter-task load balance, communication time, and overall memory utilization. Graph layout could refer to partitioning or replication of vertex and edge arrays, selective replication of data structures that hold meta-data, an...
متن کاملA Parallel Algorithm to Calculate the Costrank of a Network
We developed analogous parallel algorithms to implement CostRank for distributed memory parallel computers using multi processors. Our intent is to make CostRank calculations for the growing number of hosts in a fast and a scalable way. In the same way we intent to secure large scale networks that require fast and reliable computing to calculate the ranking of enormous graphs with thousands of ...
متن کاملTowards Dynamic Load Balancing Using Page Migration and Loop Re-partitioning on Omni/SCASH
Increasingly large-scale clusters of SMPs continue to become majority platform in HPC field. Such a cluster environment, there may be load imbalances due to several reasons and mis-placement of data which bring performance bottlenecks. To overcome these problems, some dynamic load balancing mechanisms are needed. In this paper, we report our ongoing work on dynamic load balancing extention to O...
متن کاملDynamic Load Balancing Using Space-Filling Curves
Space-filling curves (SFCs) provide a continuous mapping from a one-dimensional to a d-dimensional space and have been used to linearize spatially distributed data for partitioning, memory management and image processing. This paper gives a short introduction to the concept of SFCs and highlights some of the applications found in computer science. The main focus will be on dynamic distribution ...
متن کاملLoad Balancing Problem for Parallel Computers with Distributed Memory
This paper deals with load balancing of parallel algorithms for distributedmemory computers. The parallel versions of BLAS subroutines for matrix-vector product and LU factorization are considered. Two task partitioning algorithms are investigated and speed-ups are calculated. The cases of homogeneous and heterogeneous collections of computers/processors are studied, and special partitioning al...
متن کامل